Speech Technology and Systems in Human

نویسندگان

Li Deng

Kuansan Wang

Wu Chou

چکیده

S peech technology and systems in human-machine communication have witnessed a steady and remarkable advancement over the last two decades. Fundamental changes have taken place from theoretical foundations to practical systems, from laboratory prototypes to commercial products, and from proprietary softwares to industrial standards. As the information age continues, research in speech technology is further accelerated by the advent of powerful computing devices, the data-driven pattern recognition methods, and the need to generate machine understandable metadata for Web contents and other information sources. Although various systems are built and applied to numerous applications, the full potential of speech technology still remains to be uncovered. This special section fills the need of a comprehensive review of new approaches and advances of speech technology under a broad perspective of intelligent humanmachine communication. Speech technology and systems touch upon many essential signal processing techniques and are in the core of multimodal/ multimedia communication research. We hope that such a systematic and upto-date overview of the field can bring the awareness and applications of speech technology closer to the general signal processing community. New research trends and directions in the field of speech technology and human-machine communication systems have been evolving rapidly in recent years due to the changing business environment and technology advances. With the Internet and the Web, an increasingly large amount of voice and speech data is made available. This, together with fast computing devices, leads to a new wave of advances on speech “document” understanding and multimedia/multimodal content search. New algorithms are being studied, many of which may not have been computationally feasible in the old days. Also, the large deployment of voice over IP (VoIP) has revitalized the research on noise-robust speech processing and recognition over the IP network, which is a very different environment than in the past. While scientific rigor remains the paramount selection criterion, an attempt is made to provide a balanced coverage of new research trends among articles selected for this special section. About one year ago, we put out the call for papers for articles about speech technology and human-machine communication. We received a large number of submissions, and we would like to thank all authors for their submission. After a peer-review process, nine articles were selected that provide a comprehensive overview of the landscape in speech technology and human-machine communication. A brief overview of the selected articles is provided below in the context of the general theme of this special issue—human-machine communication.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

Teaching approaches to Computer Assisted Language Learning

Computers have been used for language teaching ever since the 1960's.Learning a second language is a challenging endeavor, and, for decades now, proponents of computer assisted language learning (CALL) have declared that help is on the horison. We investigate the suitability of deploying speech technology in computer based systems that can be used to teach foreign language skills. In this case,...

متن کامل

Developing a Standardized Medical Speech Recognition Database for Reconstructive Hand Surgery

Fast and holistic access to the patients’ clinical record is a major requirement of modern medical decision support systems (DSS). While electronic health records (EHRs) have replaced the traditional paper-based records in most healthcare organization, the data entry into these systems remains largely manual. Speech recognition technology promises substitution of the more convenient speech-base...

متن کامل

A Study of the Features and Functions of speech Perseverance (With an Emphasis on the Alavi Teachings)

The serious challenge that contemporary human is encountered with has been brought about by the lack of applying ethical and behavioral necessities in his life rather than by the weakness of the rules or lack of technology. One of the mentioned important necessities is the factor of speech perseverance which has a particular conceptual and meaningful weight that is the adducing of the right spe...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Speech Technology and Systems in Human

نویسندگان

چکیده

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Teaching approaches to Computer Assisted Language Learning

Developing a Standardized Medical Speech Recognition Database for Reconstructive Hand Surgery

A Study of the Features and Functions of speech Perseverance (With an Emphasis on the Alavi Teachings)

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

عنوان ژورنال:

اشتراک گذاری